Addressing the class imbalance problem in network intrusion detection systems using data resampling and deep learning

نویسندگان

چکیده

Abstract Network intrusion detection systems (NIDS) are the most common tool used to detect malicious attacks on a network. They help prevent ever-increasing different and provide better security for NIDS classified into signature-based anomaly-based detection. The type of is which based machine learning models able with high accuracy. However, in recent years, has achieved even results detecting already known novel adoption deep models. Benchmark datasets try simulate real-network traffic by including more normal samples than attack samples. This causes training data be imbalanced difficulties certain types NIDS. In this paper, resampling technique proposed Adaptive Synthetic (ADASYN) Tomek Links algorithms combination mitigate class imbalance problem. model evaluated benchmark NSL-KDD dataset using accuracy, precision, recall F-score metrics. experimental show that binary classification, method improves performance outperforms state-of-the-art an accuracy 99.8%. multi-class were also improved, outperforming 99.98%.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Addressing the Class Imbalance Problem in Medical Datasets

A well balanced dataset is very important for creating a good prediction model. Medical datasets are often not balanced in their class labels. Most existing classification methods tend to perform poorly on minority class examples when the dataset is extremely imbalanced. This is because they aim to optimize the overall accuracy without considering the relative distribution of each class. In thi...

متن کامل

Resampling Forgery Detection Using Deep Learning and A-Contrario Analysis

The amount of digital imagery recorded has recently grown exponentially, and with the advancement of software, such as Photoshop or Gimp, it has become easier to manipulate images. However, most images on the internet have not been manipulated and any automated manipulation detection algorithm must carefully control the false alarm rate. In this paper we discuss a method to automatically detect...

متن کامل

Addressing imbalance in multilabel classification: Measures and random resampling algorithms

The purpose of this paper is to analyze the imbalanced learning task in the multilabel scenario, aiming to accomplish two different goals. The first one is to present specialized measures directed to assess the imbalance level in multilabel datasets (MLDs). Using these measures we will be able to conclude which MLDs are imbalanced, and therefore would need an appropriate treatment. The second o...

متن کامل

Learning Greek Verb Complements: Addressing the Class Imbalance

Imbalanced training sets, where one class is heavily underrepresented compared to the others, have a bad effect on the classification of rare class instances. We apply One-sided Sampling for the first time to a lexical acquisition task (learning verb complements from Modern Greek corpora) to remove redundant and misleading training examples of verb nondependents and thereby balance our training...

متن کامل

Class Imbalance Problem in Data Mining using Probabilistic Approach

Class imbalance problem are raised when one class having maximum number of examples than other classes. The classical classifiers of balance datasets cannot deal with the class imbalance problem because they pay more attention to the majority class. The main drawback associated with it majority class is loss of important information. The Class imbalance problem is a difficult due to the amount ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: The Journal of Supercomputing

سال: 2023

ISSN: ['0920-8542', '1573-0484']

DOI: https://doi.org/10.1007/s11227-023-05073-x